A Scalable Image Snippet Extraction Framework for Integration with Search Engines
نویسندگان
چکیده
Search result visualization is a task performed by search engines that enables users to find their desired documents, in an effective and efficient manner. Image based summary or best images of a web document, displayed as a part of the visualization process, has become indispensable, as a human perceives images instantaneously. But, selection of the best image increases latency in search result generation, and workload for the search process. In this paper, we propose and implement a search framework by integrating text and image search engines that increases the speed of extracting a representative image of a web document. Text associated with an image, image area and position are incorporated with the ranking function that finds the image snippet. By comparison, we show that our framework significantly improves over the existing ones in terms of time complexity, while maintaining the quality of image based summaries.
منابع مشابه
Pseudo-relevance feedback and statistical query expansion for web snippet generation
a r t i c l e i n f o a b s t r a c t A (page or web) snippet is a document excerpt allowing a user to understand if a document is indeed relevant without accessing it. This paper proposes an effective snippet generation method. A statistical query expansion approach with pseudo-relevance feedback and text summarization techniques are applied to salient sentence extraction for good quality snip...
متن کاملDistributed search based on self-indexed compressed text
Query response times within a fraction of a second in Web search engines are feasible due to the use of indexing and caching techniques, which are devised for large text collections partitioned and replicated into a set of distributed memory processors. This paper proposes an alternative query processing method for this setting, which is based on a combination of self-indexed compressed text an...
متن کاملScalable techniques for clustering the web pdf
Scalable Clustering.and text mining, spatial database applications, Web analysis, CRM, marketing. Powerful broadly applicable data mining clustering methods surveyed below. Since scalability is the major achievement of this blend strategy, this algorithm is.Using typical document clustering techniques on Web opinions produce unsatisfying result. In this work, we propose the scalable distance-ba...
متن کاملScalable Image Annotation by Summarizing Training Samples into Labeled Prototypes
By increasing the number of images, it is essential to provide fast search methods and intelligent filtering of images. To handle images in large datasets, some relevant tags are assigned to each image to for describing its content. Automatic Image Annotation (AIA) aims to automatically assign a group of keywords to an image based on visual content of the image. AIA frameworks have two main sta...
متن کاملSemantic snippet construction for search engine results based on segment evaluation
The result listing from search engines includes a link and a snippet from the web page for each result item. The snippet in the result listing plays a vital role in assisting the user to click on it. This paper proposes a novel approach to construct the snippets based on a semantic evaluation of the segments in the page. The target segment(s) is/are identified by applying a model to evaluate se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer and Information Science
دوره 6 شماره
صفحات -
تاریخ انتشار 2013